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DETAILED ACTION 

1 . This Office Action is in response to Amendments and Remarks received 06/16/2005. 
Original claims 1-27 are pending. 

Drawings 

2. In view of receipt of Replacement Sheet for FIG. 4, the prior objection to the drawing is 
hereby withdrawn. 

Claim Objections 

3. Claim 26 is objected to for defining 'computer-readable medium' in the Specification 
(page 16, 2 nd paragraph) as a "data transmission or communications medium including packets of 
electronic data and electromagnetic or fiber optic waves modulated in accordance with the 
instructions." Claim 26 must be amended to recite a statutory, tangible embodiment. A 
suggested amendment is, "A volatile or non- volatile computer-readable medium. . ." 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

5. Claims 1-27 are rejected under 35 U.S.C. 103(a) as being unpatentable over Patent 
Application Publication 2002/0199178 Al to Hobbs et al. 

1,26, and 27: 

-method . . . computer-readable medium. . . system. . . 
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([00 10],"... method . . . [0020], "... encoded on a variety of program storage media. . . (computer- 
readable medium)", [00 1 9], " . . . computer that may be used to implement the instant 
invention. . . (system)") 

-collecting information describing a frequency of occurrence of a plurality of cache 
misses caused by at least one instruction; 

([0066], "... analyze the program to determine where cache misses and/or cache thrashing is 
occurring, and then apply the optimization. . ." Hobbs invention is directed towards [0007-0009] 
reducing the effects of 'cache thrashing' as related to cache management / cache misses [0006- a 
portion not available when needed]. As an example of 'collecting information describing a 
frequency of occurrence of a plurality of cache misses' see [0022] where Hobbs disclosed 
"observing and analyzing (collecting information) the manner in which the code uses 
resources. . .cache (frequency of cache misses). . .The process then generates execution profile 
data. . ." Hobbs disclosed the intent to 'remove cache thrashing (due to cache misses) at [0033] 
by inserting prefetch and other cache management instructions. Another example at [0035] 
suggests loop restructuring in response to determining (from collected information of frequency 
of cache misses) that the memory references within the loop may cause cache thrashing (due to 
cache misses) to occur. [0066] discloses 'analyzing (collecting information) the program to 
determine where cache misses and/or cache thrashing is occurring. . .") 
-identifying a performance degrading instruction; 

([0067], "If execution profile data contains information about which loop bodies contain 
instructions that are encountering cache misses. ..(identify degrading instruction)") 
-optimizing the program to provide an optimized sequence of instructions, the 
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optimized sequence of instructions comprising at least one prefetch instruction; 
([0066], . .then apply the optimization techniques. . .") 
-modifying the program being executed to include the optimized sequence. 
([0067], . .profile data can be used to direct which loops are restructured. . .") 

Hobbs disclosed an invention that [0002] "restructures program loops to reduce cache thrashing. 
Hobbs failed to provide specific details related to 'frequency of occurrence of a plurality of 
cache misses', but did broadly disclose that analysis was performed (including analysis of 
caches) and used in the restructuring of code. Therefore, it would have been obvious, to one of 
ordinary skill in the art, at the time of the invention, to modify Hobbs invention to specifically 
disclose that 'frequency of occurrence of a plurality of cache misses' was a consideration in the 
analysis, because he specifically discussed techniques to minimize cache thrashing caused by 
cache misses at [0025-0033]. 

2. 

-the program comprises a plurality of sequence of instructions. 

([0010], "...identifying a loop in a program (sequence of instructions)...") 



3. 

-the performance degrading instruction contributes to highest frequency of occurrence of the 
plurality cache misses. 
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([0022], ". . .The process then generates execution profile data that represents the results of its 
observations and analysis of execution of the code. [0029], "The organization of the caches 
may contribute to or cause thrashing (cache misses)." Broadly Hobbs disclosed 'frequency of 
occurrence' by disclosing that an analysis is performed on the execution of the code. Cache 
misses are considered in the analysis.) 

4. 

-the performance degrading instruction contributes to highest degradation in the program 
performance. 

(([0022], ". . .The process then generates execution profile data that represents the results of its 
observations and analysis of execution (program degradation) of the code. . .", Broadly Hobbs 
disclosed 'highest degradation of performance' by disclosing that an analysis is performed on the 
execution of the code. Degradation is considered in the analysis.) 

Hobbs disclosed an invention that [0002] "restructures program loops to reduce cache thrashing. 
Hobbs failed to provide specific details related to 'highest degradation', but did broadly disclose 
that analysis was performed (for the purpose of finding instruction sequences that degrade 
program performance) and used in the restructuring of code. Therefore, it would have been 
obvious, to one of ordinary skill in the art, at the time of the invention, to modify Hobbs 
invention to specifically disclose that 'highest degradation' was a consideration in the analysis, 
because he specifically discussed techniques to minimize degrading instructions ([0051] 
. . . optimized u8sing. . . cache management instructions. . . "). 
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5. 

-wherein the at least one instruction is the performance degrading instruction. 
([0002], "The present invention relates generally to software compilers and, more particularly, to 
a compiler that restructures program loops to reduce cache thrashing." The invention finds 
degrading instructions in a program.) 

6. 

-optimizing the program comprises inserting the at least one prefetch instruction prior to the 
performance degrading instruction. 

([0021], . .The optimization processes may perform this transformation of the code. . .", [0033], 
"... the compiler operates to insert prefetch . . . ") 

7. 

-the plurality cache misses are L2/L3 cache misses. 

([0025-0027], "Two-level caches. ..","... if the requested data is not present in the first cache (a 
miss), then the request is passed to the second cache. . .", [0029], "The organization of the caches 
may contribute to or cause thrashing (cache misses). . ." Hobbs disclosed the LI and L2 caches 
and considers the cache misses as a potential degrading opportunity.) 

8. 

-the optimized sequence is prepared while the program is placed in a suspend mode. 
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([0022], ". . .the process generates data by initially executing the code. . . observing and 
analyzing. . .The process then generates execution profile data. . .The optimization process may 
then use the data to generate a new more efficient version (in a suspend mode). . .which is then 
processed. . .The code generator process translates the expanded intermediate code into 
instructions that are specific to the architecture. . .the generator modifies the code. . . execute (in a 
resume mode / change to an execution mode) . . . ") 

9. 

-modifying the program comprises: 

changing the program from the suspend mode to the execution mode. 
See rejection of limitations in claim 8 above. 

10. 

-optimizing the program comprises: 

-receiving information describing a dependency graph for the at least one instruction; 
([0036], ". . .performing use dependence analysis. . .") 

-determining whether a cyclic dependency pattern exists in the dependency graph; 

([0036], ". . .dependence analysis will identify memory operations that have some dependency 

upon one another. . . If there is a. . . cyclic dependency cycle. . . ") 
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-if the cyclic dependency pattern exists then, computing stride information derived from the 
cyclic dependency pattern; 

([0037], "It is envisioned that loop restructuring may be advantageously performed on memory 
arrays with longer strides. . .length of stride in an array that will benefit from loop 
restructuring. . . " Hobbs gave consideration to the stride information when restructuring loop 
instructions.) 

-inserting the prefetch instruction derived from the stride information, the prefetch 
instruction being inserted into the program prior to the performance degrading instruction. 
(0041], ". . .by including features such as . . .prefetch instructions. . [0051], "The computer may 
include a variety of prefetch. . . instructions that can migrate data. . . " Additionally see sample 
code in [0052] that incorporates prefetch instructions.) 

11. 

-the dependency graph is a backward slice from the performance degrading instruction. 
([0036], "... dependence analysis will identify memory operations that have some dependency 
upon one another. . .bi-directional dependency. . .uni-directional dependency. . . ." Hobbs gives 
consideration to modifying code by understanding the dependency and producing code that 
preserves the dependency.) 

12. 

-modifying the program comprises: 
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-storing the optimized sequence; 

([0019], "The computer may be used to compile a source program. . .to execute (a stored 
compiled program) a target program that has been compiled. [0023], The object code is then 
generated. . .linker the combines. . .to produce machine dependent code that is executable. . .to 
generate the execution profile data. . .used by the processes . . .if the code does not exhibit optimal 
execution performance. . . more optimal versions of the code may be generated." Hobbs 
generates optimized code, and stores it in preparation for execution.) 

-redirecting a sequence of instructions having the performance degrading instruction to include 
the optimized sequence. 

(As an example, see the code samples in [0052] which redirects a sequence of instructions, by 
using prefetch instructions.) 

13. 

A method of optimizing a program comprising a plurality of execution paths, the method 
comprising: 

-collecting information describing a plurality of occurrences of a plurality of cache miss events 
during a runtime mode of the program; 

([0022], "... observing and analyzing the manner in which the code uses resources (collecting 
information). . ." Cache miss events are analyzed in the runtime generation of profile data. See 
more details regarding cache misses at [0025-0033].) 
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-identifying a performance degrading execution path in the program; 

([0022], ". . .then generates execution profile data that represents the results of its observations 
and analysis of execution of the code (performance degrading identified).") 

-modifying the performance degrading execution path to define an optimized execution path, the 

optimized execution path comprising at least one prefetch instruction; 

([0022], ". . .then use the data to generate (modify) a new more efficient version. . .", [0033], 

. .restructuring the loop. . .insert (modifying) prefetch and other cache management 
instructions...") 

-storing the optimized execution path; 

([0022], . .the generator modifies the code such that the code reflects scheduling and other low- 
level optimizations of the code which are dependent on the architecture of the computer that will 
execute the code (modified code is stored and ready for execution) " 

-redirecting the performance degrading execution path in the program to include the optimized 
execution path. 

([0022], ". . .the generator modifies (redirects) the code such that the code reflects scheduling and 
other low-level optimizations of the code which are dependent on the architecture of the 
computer that will execute the code (redirected optimized code is stored and ready for 
execution).") 
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Hobbs disclosed an invention that [0002] "restructures program loops to reduce cache thrashing. 
Hobbs failed to provide specific details related to 'frequency of occurrence of a plurality of 
cache misses', but did broadly disclose that analysis was performed (including analysis of 
caches) and used in the restructuring of code. Therefore, it would have been obvious, to one of 
ordinary skill in the art, at the time of the invention, to modify Hobbs invention to specifically 
disclose that 'frequency of occurrence of a plurality of cache misses' was a consideration in the 
analysis, because he specifically discussed techniques to minimize cache thrashing caused by 
cache misses at [0025-0033]. 

14. 

-the plurality of cache miss events are caused by an execution of a plurality of performance 
degrading instructions. 

([0029], "The organization of the caches may contribute or cause thrashing (cache misses).") 
15. 

-identifying the performance degrading path comprises identifying a performance degrading 
instruction contributing to highest plurality of occurrences of cache miss events. 
([0022], "... observing and analyzing the manner in which the code uses resources. . .generates 
execution profile data that represents the results of. . .analysis (identify performance degrading 
path and highest plurality of occurrences of cache miss events). . .", [0033], ". . .the instant 
invention can also remove the cache thrashing. . .by restructuring the loop. . .Additionally, the 
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compiler operates to insert prefetch and other cache management instructions. . ." Hobbs 
disclosed that cache miss events may contribute to program degradation. See [0025-0033].) 

16-18: 

-the optimized execution path is defined while placing the program in a suspend mode from the 
runtime mode. 

-the optimized execution path is executed on resuming the runtime mode of the program code 
from the suspend mode. 

-redirecting the performance degrading execution path comprises: 
-changing the program mode from the suspend mode to the execution mode. 
(See limitations as addressed in claims 8 and 9 above.) 

19. 

-the performance degrading execution path comprises a performance degrading instruction 

causing the cache miss event. 

(See limitations as addressed in claim 3 above.) 

20. 

-the at least one prefetch instruction is inserted prior to the performance degrading instruction. 
(See limitations as addressed in claim 6 above.) 



21. 
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-identifying the performance degrading execution path comprises determining whether a cache 
miss event of the plurality of cache miss events is an L2/L3 cache miss. 
(See limitations as addressed in claim 7 above.) 

22. 

-identifying the performance degrading path comprises identifying a performance degrading 
instruction contributing to highest degradation in the program performance. 
(See limitations as addressed in claim 4 above.) 

23. 

-modifying the performance degrading execution path comprises: 

-receiving information describing a dependency graph for a program degrading instruction 
contributing to highest occurrence of the plurality of cache miss events, the performance 
degrading instruction being included in the performance degrading execution path; 
([0036], "The loop restructuring routine begins. . .by performing use dependence analysis. . . and 
how the loop. . .may be distributed. . .dependence analysis will identify memory operations (cache 
miss events)...") 

-determining whether a cyclic dependency pattern exists in the graph; 

([0036], "... dependence analysis will identify memory operations that have some 

dependency. . . If there is. . . cyclic dependency. . . (determine whether a cyclic dependency pattern 

exists)") 
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-if the cyclic dependency pattern exists then, computing stride information derived from the 
cyclic dependency pattern; 

([0037], "...loop restructuring may be advantageously performed on memory arrays with longer 
strides. . .The particular organization and construction of the cache memory in the computer on 
which the compiled program is to be executed will influence the length of stride (stride 
information is considered) in an array that will benefit from loop restructuring.") 

-inserting the at least one prefetch instruction derived from the stride information, the at least one 
prefetch instruction being inserted into the optimized execution path prior to the performance 
degrading instruction. 

([0033], "...compiler operates to insert prefetch..." Also see [0051]. 
Also, note limitations as addressed in claim 10 above.) 

24. 

-the dependency graph is a backward slice from the performance degrading instruction. 
(See limitations as addressed in claim 1 1 above.) 

25. 

A method of optimizing a program, the method comprising: 

-receiving information describing a dependency graph for an instruction causing frequent cache 
misses, the instruction being included in the program; 
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([0036], . .performing use dependence analysis. . . " Hobbs suggested a dependency graph for 
instructions causing frequent cache misses: [0036] performing a 'use dependence analysis' to 
determine dependencies between memory operations / references. ) 

-determining whether a cyclic dependency pattern exists in the graph; 

([0036], "If there is (determine if it exists) . . .a cyclic dependency. . Hobbs gave consideration 
to cyclic dependencies at [0036] and disclosed that if a cyclic dependency exists (determining 
whether cyclic dependency pattern exists) then suggests optimal placement of instructions.) 

-if the cyclic dependency pattern exists then, computing stride information derived from the 
cyclic dependency pattern; 

([0037], ". . .length of stride in an array that will benefit from loop restructuring.") 

-inserting an at least one prefetch instruction derived from the stride information, the at 

least one prefetch instruction being inserted into the program prior to the instruction causing the 

frequent cache misses; 

([0051], "...the cache may be further optimized using, for example, cache management 
instructions. The computer may include a variety of prefetch and cache management instructions 

...") 



-reusing the at least one prefetch instruction in the program for reducing subsequent cache 
misses; 
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([0023], ". . .the executable code is executed by the process to generate the execution profile data 
that is used by the processes to determine whether the code, when executed, exhibits optimal 
execution performance, and if the code does not exhibit optimal execution performance, may be 
used by the processes to make more optimal versions of the code from which more optimal 
versions of the code may be generated (reuse for reducing subsequent cache misses) " 

-performing said receiving, said determining, said computing, said inserting and said reusing 
during runtime of the program. 

([0023], "". . . the executable code is executed (runtime) by the process to generate the execution 
profile data that is used by the processes to determine whether the code, when executed (runtime) 
, exhibits optimal execution performance. . . ") 

Also, see limitations addressed in claim 23 above. 

Hobbs disclosed an invention that [0002] "restructures program loops to reduce cache thrashing. 
Hobbs failed to provide specific details related to 'frequency of occurrence of a plurality of 
cache misses', but did broadly disclose that analysis was performed (including analysis of 
caches) and used in the restructuring of code. Therefore, it would have been obvious, to one of 
ordinary skill in the art, at the time of the invention, to modify Hobbs invention to specifically 
disclose that 'frequency of occurrence of a plurality of cache misses' was a consideration in the 
analysis, because he specifically discussed techniques to minimize cache thrashing caused by 
cache misses at [0025-0033]. 
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Response to Arguments 

6. Applicant has argued, in substance, the following: 

(A) As Applicant has noted on page 12, 4 th paragraph of Remarks received 6/16/2005, in 
reference to independent claims 1, 13, 26, and 27, Hobbs fails to disclose "collecting information 
describing a frequency of occurrence of a plurality of cache misses." 

Examiner's Response: Examiner disagrees. Hobbs invention is directed towards [0007-0009] 
reducing the effects of 'cache thrashing' as related to cache management / cache misses [0006- a 
portion not available when needed]. An analysis of cache thrashing / cache misses would 
inherently would include the frequency of cache misses. As an example of 'collecting 
information describing a frequency of occurrence of a plurality of cache misses' see [0022] 
where Hobbs disclosed "observing and analyzing (collecting information) the manner in which 
the code uses resources. . .cache (frequency of cache misses). . .The process then generates 
execution profile data. . ." Hobbs disclosed the intent to 'remove cache thrashing (due to cache 
misses) at [0033] by inserting prefetch and other cache management instructions. Another 
example at [0035] suggests loop restructuring in response to determining (from collected 
information of frequency of cache misses) that the memory references within the loop may cause 
cache thrashing (due to cache misses) to occur. [0066] discloses 'analyzing (collecting 
information) the program to determine where cache misses and/or cache thrashing is 
occurring..." 
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Hobbs disclosed an invention that [0002] "restructures program loops to reduce cache thrashing. 
Hobbs failed to provide specific details related to 'frequency of occurrence of a plurality of 
cache misses', but did broadly disclose that analysis was performed (including analysis of 
caches) and used in the restructuring of code. Regarding Therefore, it would have been obvious, 
to one of ordinary skill in the art, at the time of the invention, to modify Hobbs invention to 
specifically disclose that 'frequency of occurrence of a plurality of cache misses' was a 
consideration in the analysis, because he specifically discussed techniques to minimize cache 
thrashing caused by cache misses at [0025-0033]. 

(B) As Applicant has noted on page 13, 4 th paragraph, in reference to independent claim 25, 
Hobbs fails to disclose "receiving information describing a dependency graph for an instruction 
causing frequent cache misses, the instruction being included in the program; determining 
whether a cyclic dependency pattern exists in the graph." 

Examiner's Response: Examiner disagrees. Hobbs suggested a dependency graph for 
instructions causing frequent cache misses: [0036] performing a 'use dependence analysis' to 
determine dependencies between memory operations / references. Hobbs gave consideration to 
cyclic dependencies at [0036] and disclosed that if a cyclic dependency exists (determining 
whether cyclic dependency pattern exists) then suggests optimal placement of instructions. 

Examiner maintains rejections of claims 1-27. 
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Conclusion 



7. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1. 136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

8. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Mary Steelman, whose telephone number is (571) 272-3704. The 
examiner can normally be reached Monday through Thursday, from 7:00 AM to 5:30 PM If 
attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, Tuan 
Q. Dam can be reached at (571) 272-3695. The fax phone number for the organization where 
this application or proceeding is assigned: 571-273-8300. 
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Any inquiry of a general nature or relating to the status of this application should be 
directed to the TC 2100 Group receptionist: 571-272-2100. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



08/24/2005 



Mary Steelman 




sup' 



